Parametric Coding of Stereo Audio Based on Principal Component Analysis
نویسندگان
چکیده
Low bit rate parametric coding of multichannel audio is mainly based on Binaural Cue Coding (BCC). Another multichannel audio processing method called upmix can also be used to deliver multichannel audio, typically 5.1 signals, at low data rates. More precisely, we focus on existing upmix method based on Principal Component Analysis (PCA). This PCA-based upmix method aims at blindly create a realistic multichannel output signal while BCC scheme aims at perceptually restitute the original multichannel audio signal. PCA-based upmix method and BCC scheme both use spatial parameters extracted from stereo channels to generate auditory events with correct spatial attributes i.e. sound sources positions and spatial impression. In this paper, we expose a multichannel audio model based on PCA which allows a parametric representation of multichannel audio. Considering stereo audio, signals resulting from PCA can be represented as a principal component, corresponding to directional sources, and one remaining signal, corresponding to ambience signals, which are both related to original input with PCA transformation parameters. We apply the analysis results to propose a new parametric coding method of stereo audio based on subband PCA processing. The quantization of spatial and energetic parameters is presented and then associated with a state-of-the-art monophonic coder in order to derive subjective listening test results.
منابع مشابه
Low Complexity Parametric Stereo Coding in Mpeg - 4
Parametric stereo coding in combination with a State-of-the-Art coder for the underlying monaural audio signal results in the most ef cient coding scheme for stereo signals at very low bit rates available today. This paper reviews those aspects of the parametric stereo paradigm that are important for audio coding applications. A complete parametric stereo coding system is presented, which was r...
متن کاملLow Complexity Decoding in Parametric Stereo Audio Coding Scheme
Parametric Stereo (PS) is an audio coding object of MPEG-4 HE-AAC v2 which utilized the Spatial Audio Coding (SAC) technique to enhance the compressing efficiency. However, the complexity at decoder is higher than that at encoder in PS. In this paper, we proposed a low complexity decoding scheme in PS. To take advantage of SAC, the encoder additionally extracts and transmits the parameters of r...
متن کاملContext-Based Arithmetic Coding Scheme for Parametric Stereo in Enhanced aacPlus
Enhanced aacPlus is an audio codec which is composed of advanced audio coding (AAC), spectral band replication (SBR), and parametric stereo (PS) for efficient audio coding at low bit rates. We propose a new coding scheme for lossless bit rate reduction of PS in enhanced aacPlus. We first determine the optimal contexts for context-based coding of quantized stereo parameter indexes in PS. Then we...
متن کاملParametric Coding of Stereo Audio
Parametric-stereo coding is a technique to efficiently code a stereo audio signal as a monaural signal plus a small amount of parametric overhead to describe the stereo image. The stereo properties are analyzed, encoded, and reinstated in a decoder according to spatial psychoacoustical principles. The monaural signal can be encoded using any (conventional) audio coder. Experiments show that the...
متن کاملPrimary-ambient Extraction in Audio Signals Using Adaptive Weighting and Principal Component Analysis
Most audio recordings are in the form of a 2-channel stereo recording while new playback sound systems make use of more loudspeakers that are designed to give a more spatial and surrounding atmosphere that is beyond the content of the stereo recording. Hence, it is essential to extract more spatial information from stereo recording in order to reach an enhanced upmixing techniques. One way is b...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2006